A Maximum Entropy Model of Phonotactics and
نویسندگان
چکیده
The study of phonotactics (e.g., the ability of English speakers to distinguish possible words like blick from impossible words like *bnick) is a central topic in phonology. We propose a theory of phonotactic grammars and a learning algorithm that constructs such grammars from positive evidence. Our grammars consist of constraints that are assigned numerical weights according to the principle of maximum entropy. Possible words are assessed by these grammars based on the weighted sum of their constraint violations. The learning algorithm yields grammars that can capture both categorical and gradient phonotactic patterns. The algorithm is not provided with any constraints in advance, but uses its own resources to form constraints and weight them. A baseline model, in which Universal Grammar is reduced to a feature set and an SPE-style constraint format, suffices to learn many phonotactic phenomena. In order to learn nonlocal phenomena such as stress and vowel harmony, it is necessary to augment the model with autosegmental tiers and metrical grids. Our results thus offer novel, learning-theoretic support for such representations. We apply the model to English syllable onsets, Shona vowel harmony, quantity-insensitive stress typology, and the full phonotactics of Wargamay, showing that the learned grammars capture the distributional generalizations of these languages and accurately predict the findings of a phonotactic experiment.
منابع مشابه
A Maximum Entropy Model of Phonotactics and Phonotactic Learning
The study of phonotactics (e.g., the ability of English speakers to distinguish possible words like blick from impossible words like *bnick) is a central topic in phonology. We propose a theory of phonotactic grammars and a learning algorithm that constructs such grammars from positive evidence. Our grammars consist of constraints that are assigned numerical weights according to the principle o...
متن کاملPhonotactics as phonology: Knowledge of a complex restriction in Dutch
The Dutch lexicon contains very few sequences of a long vowel followed by a consonant cluster, where the second member of the cluster is a non-coronal. We provide experimental evidence that Dutch speakers have implicit knowledge of this gap, which cannot be reduced to the probability of segmental sequences or to word-likeness as measured by neighborhood density. The experiment also suggests tha...
متن کاملPhonotactics as phonology: Knowledge of a complex constraint in Dutch
The Dutch lexicon contains very few sequences of a long vowel followed by a consonant cluster, where the second member of the cluster is a non-coronal. We provide experimental evidence that Dutch speakers have implicit knowledge of this gap, which cannot be reduced to the probability of segmental sequences or to word-likeness as measured by neighborhood density. The experiment also shows that t...
متن کاملSpatial Simulation and Land-subsidence Susceptibility Mapping Using Maximum Entropy Model
The aim of this research is spatial Simulation and land subsidence susceptibility mapping using maximum entropy model in Jiroft and Anbarabad Townships. At first, land subsidence locations were recognized using extensive field surveys and subsequently the land subsidence distribution map was made in the geographic information system. Then, each of effective factors on land subsidence occurred i...
متن کاملHabitat suitability modeling of water birds and waders in Hamun wetland by by Maximum Entropy model
Climate change and human activities have increased negative pressure on natural ecosystems. Wetlands are such ecosystems that widely affected by these negative changes. Birds as a part of wildlife in a wetland have damaged by destruction of wetlands, so, a large group of them, are at risk of extinction. Habitat destruction in wetlands in arid and semi-arid areas has more negative effects on the...
متن کامل